Overview

Brought to you by YData

Dataset statistics

Number of variables16
Number of observations999
Missing cells428
Missing cells (%)2.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory760.9 KiB
Average record size in memory780.0 B

Variable types

Numeric7
Text8
Categorical1

Alerts

Rating is highly overall correlated with Unnamed: 0High correlation
Revenue is highly overall correlated with VotesHigh correlation
Unnamed: 0 is highly overall correlated with RatingHigh correlation
Votes is highly overall correlated with RevenueHigh correlation
Certificate has 101 (10.1%) missing values Missing
scoreAvg has 157 (15.7%) missing values Missing
Revenue has 169 (16.9%) missing values Missing
Unnamed: 0 is uniformly distributed Uniform
Unnamed: 0 has unique values Unique
Overview has unique values Unique

Reproduction

Analysis started2025-09-02 13:50:10.382906
Analysis finished2025-09-02 13:50:39.836286
Duration29.45 seconds
Software versionydata-profiling vv4.16.1
Download configurationconfig.json

Variables

Unnamed: 0
Real number (ℝ)

High correlation  Uniform  Unique 

Distinct999
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean500
Minimum1
Maximum999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2025-09-02T10:50:40.188115image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile50.9
Q1250.5
median500
Q3749.5
95-th percentile949.1
Maximum999
Range998
Interquartile range (IQR)499

Descriptive statistics

Standard deviation288.53076
Coefficient of variation (CV)0.57706152
Kurtosis-1.2
Mean500
Median Absolute Deviation (MAD)250
Skewness0
Sum499500
Variance83250
MonotonicityStrictly increasing
2025-09-02T10:50:40.652486image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
999 1
 
0.1%
1 1
 
0.1%
2 1
 
0.1%
3 1
 
0.1%
4 1
 
0.1%
5 1
 
0.1%
6 1
 
0.1%
983 1
 
0.1%
982 1
 
0.1%
981 1
 
0.1%
Other values (989) 989
99.0%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
999 1
0.1%
998 1
0.1%
997 1
0.1%
996 1
0.1%
995 1
0.1%
994 1
0.1%
993 1
0.1%
992 1
0.1%
991 1
0.1%
990 1
0.1%

Title
Text

Distinct998
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size73.0 KiB
2025-09-02T10:50:41.617396image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length68
Median length41
Mean length15.443443
Min length2

Characters and Unicode

Total characters15428
Distinct characters100
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique997 ?
Unique (%)99.8%

Sample

1st rowThe Godfather
2nd rowThe Dark Knight
3rd rowThe Godfather: Part II
4th row12 Angry Men
5th rowThe Lord of the Rings: The Return of the King
ValueCountFrequency (%)
the 274
 
9.8%
of 86
 
3.1%
a 32
 
1.2%
and 28
 
1.0%
no 24
 
0.9%
la 23
 
0.8%
in 22
 
0.8%
to 18
 
0.6%
de 17
 
0.6%
man 17
 
0.6%
Other values (1664) 2241
80.6%
2025-09-02T10:50:43.262734image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1783
 
11.6%
e 1425
 
9.2%
a 1126
 
7.3%
o 965
 
6.3%
n 921
 
6.0%
i 861
 
5.6%
r 816
 
5.3%
t 755
 
4.9%
h 564
 
3.7%
s 562
 
3.6%
Other values (90) 5650
36.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11162
72.3%
Uppercase Letter 2191
 
14.2%
Space Separator 1783
 
11.6%
Other Punctuation 177
 
1.1%
Decimal Number 79
 
0.5%
Dash Punctuation 31
 
0.2%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Other Number 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1425
12.8%
a 1126
10.1%
o 965
 
8.6%
n 921
 
8.3%
i 861
 
7.7%
r 816
 
7.3%
t 755
 
6.8%
h 564
 
5.1%
s 562
 
5.0%
l 514
 
4.6%
Other values (38) 2653
23.8%
Uppercase Letter
ValueCountFrequency (%)
T 283
 
12.9%
S 187
 
8.5%
B 157
 
7.2%
M 139
 
6.3%
D 129
 
5.9%
L 119
 
5.4%
A 113
 
5.2%
C 101
 
4.6%
H 98
 
4.5%
P 97
 
4.4%
Other values (18) 768
35.1%
Decimal Number
ValueCountFrequency (%)
2 23
29.1%
1 15
19.0%
0 11
13.9%
3 8
 
10.1%
4 5
 
6.3%
7 5
 
6.3%
9 4
 
5.1%
5 4
 
5.1%
8 2
 
2.5%
6 2
 
2.5%
Other Punctuation
ValueCountFrequency (%)
: 62
35.0%
. 47
26.6%
' 32
18.1%
, 16
 
9.0%
! 7
 
4.0%
& 6
 
3.4%
? 3
 
1.7%
/ 3
 
1.7%
· 1
 
0.6%
Space Separator
ValueCountFrequency (%)
1783
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Number
ValueCountFrequency (%)
½ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 13353
86.6%
Common 2075
 
13.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1425
 
10.7%
a 1126
 
8.4%
o 965
 
7.2%
n 921
 
6.9%
i 861
 
6.4%
r 816
 
6.1%
t 755
 
5.7%
h 564
 
4.2%
s 562
 
4.2%
l 514
 
3.8%
Other values (66) 4844
36.3%
Common
ValueCountFrequency (%)
1783
85.9%
: 62
 
3.0%
. 47
 
2.3%
' 32
 
1.5%
- 31
 
1.5%
2 23
 
1.1%
, 16
 
0.8%
1 15
 
0.7%
0 11
 
0.5%
3 8
 
0.4%
Other values (14) 47
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15362
99.6%
None 66
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1783
 
11.6%
e 1425
 
9.3%
a 1126
 
7.3%
o 965
 
6.3%
n 921
 
6.0%
i 861
 
5.6%
r 816
 
5.3%
t 755
 
4.9%
h 564
 
3.7%
s 562
 
3.7%
Other values (64) 5584
36.3%
None
ValueCountFrequency (%)
ô 14
21.2%
é 6
 
9.1%
û 5
 
7.6%
è 5
 
7.6%
â 5
 
7.6%
ä 4
 
6.1%
î 2
 
3.0%
ù 2
 
3.0%
ü 2
 
3.0%
á 2
 
3.0%
Other values (16) 19
28.8%

Year
Real number (ℝ)

Distinct99
Distinct (%)9.9%
Missing1
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1991.2144
Minimum1920
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2025-09-02T10:50:43.595081image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1920
5-th percentile1944
Q11976
median1999
Q32009
95-th percentile2017
Maximum2020
Range100
Interquartile range (IQR)33

Descriptive statistics

Standard deviation23.308539
Coefficient of variation (CV)0.01170569
Kurtosis-0.02478235
Mean1991.2144
Median Absolute Deviation (MAD)14
Skewness-0.93854006
Sum1987232
Variance543.28798
MonotonicityNot monotonic
2025-09-02T10:50:44.255603image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2014 32
 
3.2%
2004 31
 
3.1%
2009 29
 
2.9%
2013 28
 
2.8%
2016 28
 
2.8%
2001 27
 
2.7%
2006 26
 
2.6%
2007 26
 
2.6%
2015 25
 
2.5%
2012 24
 
2.4%
Other values (89) 722
72.3%
ValueCountFrequency (%)
1920 1
 
0.1%
1921 1
 
0.1%
1922 1
 
0.1%
1924 1
 
0.1%
1925 2
0.2%
1926 1
 
0.1%
1927 2
0.2%
1928 2
0.2%
1930 1
 
0.1%
1931 3
0.3%
ValueCountFrequency (%)
2020 6
 
0.6%
2019 23
2.3%
2018 19
1.9%
2017 22
2.2%
2016 28
2.8%
2015 25
2.5%
2014 32
3.2%
2013 28
2.8%
2012 24
2.4%
2011 18
1.8%

Certificate
Categorical

Missing 

Distinct16
Distinct (%)1.8%
Missing101
Missing (%)10.1%
Memory size2.6 KiB
U
234 
A
196 
UA
175 
R
146 
PG-13
43 
Other values (11)
104 

Length

Max length8
Median length1
Mean length1.7371938
Min length1

Characters and Unicode

Total characters1560
Distinct characters24
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.6%

Sample

1st rowA
2nd rowUA
3rd rowA
4th rowU
5th rowU

Common Values

ValueCountFrequency (%)
U 234
23.4%
A 196
19.6%
UA 175
17.5%
R 146
14.6%
PG-13 43
 
4.3%
PG 37
 
3.7%
Passed 34
 
3.4%
G 12
 
1.2%
Approved 11
 
1.1%
TV-PG 3
 
0.3%
Other values (6) 7
 
0.7%
(Missing) 101
10.1%

Length

2025-09-02T10:50:44.710787image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
u 234
26.1%
a 196
21.8%
ua 175
19.5%
r 146
16.3%
pg-13 43
 
4.8%
pg 37
 
4.1%
passed 34
 
3.8%
g 12
 
1.3%
approved 11
 
1.2%
tv-pg 3
 
0.3%
Other values (6) 7
 
0.8%

Most occurring characters

ValueCountFrequency (%)
U 411
26.3%
A 384
24.6%
R 146
 
9.4%
P 119
 
7.6%
G 97
 
6.2%
s 68
 
4.4%
- 48
 
3.1%
e 46
 
2.9%
d 46
 
2.9%
1 45
 
2.9%
Other values (14) 150
 
9.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 1168
74.9%
Lowercase Letter 253
 
16.2%
Decimal Number 90
 
5.8%
Dash Punctuation 48
 
3.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 68
26.9%
e 46
18.2%
d 46
18.2%
a 35
13.8%
p 22
 
8.7%
r 12
 
4.7%
o 11
 
4.3%
v 11
 
4.3%
n 1
 
0.4%
t 1
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
U 411
35.2%
A 384
32.9%
R 146
 
12.5%
P 119
 
10.2%
G 97
 
8.3%
T 5
 
0.4%
V 5
 
0.4%
M 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 45
50.0%
3 43
47.8%
6 1
 
1.1%
4 1
 
1.1%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1421
91.1%
Common 139
 
8.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 411
28.9%
A 384
27.0%
R 146
 
10.3%
P 119
 
8.4%
G 97
 
6.8%
s 68
 
4.8%
e 46
 
3.2%
d 46
 
3.2%
a 35
 
2.5%
p 22
 
1.5%
Other values (8) 47
 
3.3%
Common
ValueCountFrequency (%)
- 48
34.5%
1 45
32.4%
3 43
30.9%
6 1
 
0.7%
4 1
 
0.7%
/ 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1560
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
U 411
26.3%
A 384
24.6%
R 146
 
9.4%
P 119
 
7.6%
G 97
 
6.2%
s 68
 
4.4%
- 48
 
3.1%
e 46
 
2.9%
d 46
 
2.9%
1 45
 
2.9%
Other values (14) 150
 
9.6%

Runtime
Real number (ℝ)

Distinct140
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122.87187
Minimum45
Maximum321
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2025-09-02T10:50:45.269740image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum45
5-th percentile87
Q1103
median119
Q3137
95-th percentile178
Maximum321
Range276
Interquartile range (IQR)34

Descriptive statistics

Standard deviation28.101227
Coefficient of variation (CV)0.2287035
Kurtosis3.4289066
Mean122.87187
Median Absolute Deviation (MAD)17
Skewness1.2098771
Sum122749
Variance789.67896
MonotonicityNot monotonic
2025-09-02T10:50:45.776336image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 23
 
2.3%
130 23
 
2.3%
129 22
 
2.2%
101 22
 
2.2%
113 22
 
2.2%
110 20
 
2.0%
122 20
 
2.0%
108 19
 
1.9%
102 18
 
1.8%
96 17
 
1.7%
Other values (130) 793
79.4%
ValueCountFrequency (%)
45 1
 
0.1%
64 1
 
0.1%
67 1
 
0.1%
68 1
 
0.1%
69 1
 
0.1%
70 1
 
0.1%
71 2
0.2%
72 2
0.2%
75 2
0.2%
76 3
0.3%
ValueCountFrequency (%)
321 1
0.1%
242 1
0.1%
238 1
0.1%
229 1
0.1%
228 1
0.1%
224 1
0.1%
220 1
0.1%
212 1
0.1%
210 1
0.1%
209 1
0.1%

Genre
Text

Distinct202
Distinct (%)20.2%
Missing0
Missing (%)0.0%
Memory size74.3 KiB
2025-09-02T10:50:46.429171image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length29
Median length24
Mean length19.077077
Min length5

Characters and Unicode

Total characters19058
Distinct characters33
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)7.2%

Sample

1st rowCrime, Drama
2nd rowAction, Crime, Drama
3rd rowCrime, Drama
4th rowCrime, Drama
5th rowAction, Adventure, Drama
ValueCountFrequency (%)
drama 723
28.5%
comedy 233
 
9.2%
crime 209
 
8.2%
adventure 196
 
7.7%
action 189
 
7.4%
thriller 137
 
5.4%
romance 125
 
4.9%
biography 109
 
4.3%
mystery 99
 
3.9%
animation 82
 
3.2%
Other values (11) 438
17.2%
2025-09-02T10:50:47.351780image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 2018
 
10.6%
r 1871
 
9.8%
, 1541
 
8.1%
1541
 
8.1%
m 1447
 
7.6%
e 1235
 
6.5%
i 1144
 
6.0%
o 896
 
4.7%
n 760
 
4.0%
t 727
 
3.8%
Other values (23) 5878
30.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 13264
69.6%
Uppercase Letter 2626
 
13.8%
Other Punctuation 1541
 
8.1%
Space Separator 1541
 
8.1%
Dash Punctuation 86
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 2018
15.2%
r 1871
14.1%
m 1447
10.9%
e 1235
9.3%
i 1144
8.6%
o 896
6.8%
n 760
 
5.7%
t 727
 
5.5%
y 718
 
5.4%
c 433
 
3.3%
Other values (8) 2015
15.2%
Uppercase Letter
ValueCountFrequency (%)
D 723
27.5%
A 467
17.8%
C 442
16.8%
F 208
 
7.9%
M 151
 
5.8%
T 137
 
5.2%
R 125
 
4.8%
B 109
 
4.2%
H 88
 
3.4%
S 86
 
3.3%
Other values (2) 90
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 1541
100.0%
Space Separator
ValueCountFrequency (%)
1541
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 15890
83.4%
Common 3168
 
16.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 2018
12.7%
r 1871
11.8%
m 1447
 
9.1%
e 1235
 
7.8%
i 1144
 
7.2%
o 896
 
5.6%
n 760
 
4.8%
t 727
 
4.6%
D 723
 
4.6%
y 718
 
4.5%
Other values (20) 4351
27.4%
Common
ValueCountFrequency (%)
, 1541
48.6%
1541
48.6%
- 86
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19058
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 2018
 
10.6%
r 1871
 
9.8%
, 1541
 
8.1%
1541
 
8.1%
m 1447
 
7.6%
e 1235
 
6.5%
i 1144
 
6.0%
o 896
 
4.7%
n 760
 
4.0%
t 727
 
3.8%
Other values (23) 5878
30.8%

Rating
Real number (ℝ)

High correlation 

Distinct16
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.9479479
Minimum7.6
Maximum9.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2025-09-02T10:50:47.618602image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum7.6
5-th percentile7.6
Q17.7
median7.9
Q38.1
95-th percentile8.5
Maximum9.2
Range1.6
Interquartile range (IQR)0.4

Descriptive statistics

Standard deviation0.27228951
Coefficient of variation (CV)0.034259096
Kurtosis1.0583968
Mean7.9479479
Median Absolute Deviation (MAD)0.2
Skewness0.94669269
Sum7940
Variance0.074141576
MonotonicityDecreasing
2025-09-02T10:50:48.059768image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
7.7 157
15.7%
7.8 151
15.1%
8 141
14.1%
8.1 127
12.7%
7.6 123
12.3%
7.9 106
10.6%
8.2 67
6.7%
8.3 44
 
4.4%
8.4 31
 
3.1%
8.5 20
 
2.0%
Other values (6) 32
 
3.2%
ValueCountFrequency (%)
7.6 123
12.3%
7.7 157
15.7%
7.8 151
15.1%
7.9 106
10.6%
8 141
14.1%
8.1 127
12.7%
8.2 67
6.7%
8.3 44
 
4.4%
8.4 31
 
3.1%
8.5 20
 
2.0%
ValueCountFrequency (%)
9.2 1
 
0.1%
9 3
 
0.3%
8.9 3
 
0.3%
8.8 5
 
0.5%
8.7 5
 
0.5%
8.6 15
 
1.5%
8.5 20
 
2.0%
8.4 31
3.1%
8.3 44
4.4%
8.2 67
6.7%

Overview
Text

Unique 

Distinct999
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size201.4 KiB
2025-09-02T10:50:49.295594image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length313
Median length197
Mean length146.28328
Min length40

Characters and Unicode

Total characters146137
Distinct characters86
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique999 ?
Unique (%)100.0%

Sample

1st rowAn organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son.
2nd rowWhen the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice.
3rd rowThe early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate.
4th rowA jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence.
5th rowGandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring.
ValueCountFrequency (%)
a 1609
 
6.4%
the 1206
 
4.8%
to 803
 
3.2%
of 777
 
3.1%
and 696
 
2.8%
in 565
 
2.3%
his 516
 
2.1%
an 291
 
1.2%
is 245
 
1.0%
with 242
 
1.0%
Other values (5878) 18034
72.2%
2025-09-02T10:50:51.289944image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23999
16.4%
e 13867
 
9.5%
a 9800
 
6.7%
t 9329
 
6.4%
i 8842
 
6.1%
n 8580
 
5.9%
o 8559
 
5.9%
r 8202
 
5.6%
s 7965
 
5.5%
h 5625
 
3.8%
Other values (76) 41369
28.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 114964
78.7%
Space Separator 24000
 
16.4%
Uppercase Letter 3515
 
2.4%
Other Punctuation 2721
 
1.9%
Decimal Number 509
 
0.3%
Dash Punctuation 395
 
0.3%
Open Punctuation 13
 
< 0.1%
Close Punctuation 13
 
< 0.1%
Currency Symbol 4
 
< 0.1%
Final Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 13867
12.1%
a 9800
 
8.5%
t 9329
 
8.1%
i 8842
 
7.7%
n 8580
 
7.5%
o 8559
 
7.4%
r 8202
 
7.1%
s 7965
 
6.9%
h 5625
 
4.9%
l 4847
 
4.2%
Other values (23) 29348
25.5%
Uppercase Letter
ValueCountFrequency (%)
A 712
20.3%
T 258
 
7.3%
I 258
 
7.3%
W 228
 
6.5%
S 223
 
6.3%
B 176
 
5.0%
M 167
 
4.8%
C 158
 
4.5%
H 139
 
4.0%
R 119
 
3.4%
Other values (17) 1077
30.6%
Decimal Number
ValueCountFrequency (%)
1 117
23.0%
0 104
20.4%
9 94
18.5%
2 43
 
8.4%
6 33
 
6.5%
7 30
 
5.9%
5 26
 
5.1%
8 23
 
4.5%
4 21
 
4.1%
3 18
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 1278
47.0%
, 1082
39.8%
' 260
 
9.6%
" 60
 
2.2%
: 16
 
0.6%
? 11
 
0.4%
/ 8
 
0.3%
; 6
 
0.2%
Space Separator
ValueCountFrequency (%)
23999
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 395
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 4
100.0%
Final Punctuation
ValueCountFrequency (%)
» 2
100.0%
Initial Punctuation
ValueCountFrequency (%)
« 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 118479
81.1%
Common 27658
 
18.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 13867
11.7%
a 9800
 
8.3%
t 9329
 
7.9%
i 8842
 
7.5%
n 8580
 
7.2%
o 8559
 
7.2%
r 8202
 
6.9%
s 7965
 
6.7%
h 5625
 
4.7%
l 4847
 
4.1%
Other values (50) 32863
27.7%
Common
ValueCountFrequency (%)
23999
86.8%
. 1278
 
4.6%
, 1082
 
3.9%
- 395
 
1.4%
' 260
 
0.9%
1 117
 
0.4%
0 104
 
0.4%
9 94
 
0.3%
" 60
 
0.2%
2 43
 
0.2%
Other values (16) 226
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 146116
> 99.9%
None 21
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23999
16.4%
e 13867
 
9.5%
a 9800
 
6.7%
t 9329
 
6.4%
i 8842
 
6.1%
n 8580
 
5.9%
o 8559
 
5.9%
r 8202
 
5.6%
s 7965
 
5.5%
h 5625
 
3.8%
Other values (65) 41348
28.3%
None
ValueCountFrequency (%)
é 9
42.9%
» 2
 
9.5%
è 2
 
9.5%
ü 1
 
4.8%
  1
 
4.8%
ä 1
 
4.8%
ç 1
 
4.8%
« 1
 
4.8%
ö 1
 
4.8%
É 1
 
4.8%

scoreAvg
Real number (ℝ)

Missing 

Distinct63
Distinct (%)7.5%
Missing157
Missing (%)15.7%
Infinite0
Infinite (%)0.0%
Mean77.969121
Minimum28
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.9 KiB
2025-09-02T10:50:51.683171image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum28
5-th percentile56
Q170
median79
Q387
95-th percentile96
Maximum100
Range72
Interquartile range (IQR)17

Descriptive statistics

Standard deviation12.383257
Coefficient of variation (CV)0.15882258
Kurtosis0.41651678
Mean77.969121
Median Absolute Deviation (MAD)8
Skewness-0.60431623
Sum65650
Variance153.34506
MonotonicityNot monotonic
2025-09-02T10:50:52.287079image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
76 32
 
3.2%
84 29
 
2.9%
90 29
 
2.9%
86 27
 
2.7%
72 27
 
2.7%
73 27
 
2.7%
85 27
 
2.7%
77 26
 
2.6%
80 26
 
2.6%
81 26
 
2.6%
Other values (53) 566
56.7%
(Missing) 157
 
15.7%
ValueCountFrequency (%)
28 1
 
0.1%
30 1
 
0.1%
33 1
 
0.1%
36 1
 
0.1%
40 1
 
0.1%
41 1
 
0.1%
44 1
 
0.1%
45 3
0.3%
46 1
 
0.1%
47 4
0.4%
ValueCountFrequency (%)
100 12
1.2%
99 4
 
0.4%
98 9
0.9%
97 12
1.2%
96 18
1.8%
95 11
1.1%
94 20
2.0%
93 14
1.4%
92 13
1.3%
91 19
1.9%
Distinct548
Distinct (%)54.9%
Missing0
Missing (%)0.0%
Memory size70.7 KiB
2025-09-02T10:50:53.658387image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length32
Median length22
Mean length13.485485
Min length7

Characters and Unicode

Total characters13472
Distinct characters69
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique353 ?
Unique (%)35.3%

Sample

1st rowFrancis Ford Coppola
2nd rowChristopher Nolan
3rd rowFrancis Ford Coppola
4th rowSidney Lumet
5th rowPeter Jackson
ValueCountFrequency (%)
john 34
 
1.6%
david 28
 
1.4%
james 23
 
1.1%
robert 20
 
1.0%
martin 16
 
0.8%
richard 15
 
0.7%
lee 15
 
0.7%
george 14
 
0.7%
steven 14
 
0.7%
alfred 14
 
0.7%
Other values (882) 1879
90.7%
2025-09-02T10:50:55.484606image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1209
 
9.0%
a 1126
 
8.4%
1073
 
8.0%
n 950
 
7.1%
r 917
 
6.8%
o 851
 
6.3%
i 834
 
6.2%
l 543
 
4.0%
s 497
 
3.7%
t 433
 
3.2%
Other values (59) 5039
37.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10223
75.9%
Uppercase Letter 2107
 
15.6%
Space Separator 1073
 
8.0%
Other Punctuation 43
 
0.3%
Dash Punctuation 26
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 1209
11.8%
a 1126
11.0%
n 950
 
9.3%
r 917
 
9.0%
o 851
 
8.3%
i 834
 
8.2%
l 543
 
5.3%
s 497
 
4.9%
t 433
 
4.2%
h 404
 
4.0%
Other values (26) 2459
24.1%
Uppercase Letter
ValueCountFrequency (%)
S 179
 
8.5%
A 171
 
8.1%
M 166
 
7.9%
J 162
 
7.7%
C 142
 
6.7%
R 131
 
6.2%
H 110
 
5.2%
B 106
 
5.0%
T 102
 
4.8%
D 99
 
4.7%
Other values (19) 739
35.1%
Other Punctuation
ValueCountFrequency (%)
. 41
95.3%
' 2
 
4.7%
Space Separator
ValueCountFrequency (%)
1073
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12330
91.5%
Common 1142
 
8.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 1209
 
9.8%
a 1126
 
9.1%
n 950
 
7.7%
r 917
 
7.4%
o 851
 
6.9%
i 834
 
6.8%
l 543
 
4.4%
s 497
 
4.0%
t 433
 
3.5%
h 404
 
3.3%
Other values (55) 4566
37.0%
Common
ValueCountFrequency (%)
1073
94.0%
. 41
 
3.6%
- 26
 
2.3%
' 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13421
99.6%
None 51
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 1209
 
9.0%
a 1126
 
8.4%
1073
 
8.0%
n 950
 
7.1%
r 917
 
6.8%
o 851
 
6.3%
i 834
 
6.2%
l 543
 
4.0%
s 497
 
3.7%
t 433
 
3.2%
Other values (46) 4988
37.2%
None
ValueCountFrequency (%)
ó 10
19.6%
á 9
17.6%
é 8
15.7%
ñ 7
13.7%
ô 5
9.8%
ö 3
 
5.9%
ç 2
 
3.9%
Ö 2
 
3.9%
Ô 1
 
2.0%
Ç 1
 
2.0%
Other values (3) 3
 
5.9%

Star1
Text

Distinct659
Distinct (%)66.0%
Missing0
Missing (%)0.0%
Memory size70.3 KiB
2025-09-02T10:50:56.912756image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length25
Median length21
Mean length13.005005
Min length4

Characters and Unicode

Total characters12992
Distinct characters72
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique502 ?
Unique (%)50.3%

Sample

1st rowMarlon Brando
2nd rowChristian Bale
3rd rowAl Pacino
4th rowHenry Fonda
5th rowElijah Wood
ValueCountFrequency (%)
tom 22
 
1.1%
daniel 17
 
0.8%
robert 17
 
0.8%
john 16
 
0.8%
khan 16
 
0.8%
james 15
 
0.7%
michael 12
 
0.6%
hanks 12
 
0.6%
ethan 11
 
0.5%
de 11
 
0.5%
Other values (1112) 1898
92.7%
2025-09-02T10:50:58.572326image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1239
 
9.5%
e 1088
 
8.4%
1048
 
8.1%
n 951
 
7.3%
r 816
 
6.3%
i 794
 
6.1%
o 767
 
5.9%
l 590
 
4.5%
t 453
 
3.5%
s 438
 
3.4%
Other values (62) 4808
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9784
75.3%
Uppercase Letter 2099
 
16.2%
Space Separator 1048
 
8.1%
Dash Punctuation 32
 
0.2%
Other Punctuation 29
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1239
12.7%
e 1088
11.1%
n 951
9.7%
r 816
 
8.3%
i 794
 
8.1%
o 767
 
7.8%
l 590
 
6.0%
t 453
 
4.6%
s 438
 
4.5%
h 424
 
4.3%
Other values (29) 2224
22.7%
Uppercase Letter
ValueCountFrequency (%)
C 187
 
8.9%
M 172
 
8.2%
J 144
 
6.9%
D 142
 
6.8%
B 142
 
6.8%
S 141
 
6.7%
R 126
 
6.0%
A 115
 
5.5%
H 106
 
5.1%
T 104
 
5.0%
Other values (19) 720
34.3%
Other Punctuation
ValueCountFrequency (%)
. 19
65.5%
' 10
34.5%
Space Separator
ValueCountFrequency (%)
1048
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 11883
91.5%
Common 1109
 
8.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1239
 
10.4%
e 1088
 
9.2%
n 951
 
8.0%
r 816
 
6.9%
i 794
 
6.7%
o 767
 
6.5%
l 590
 
5.0%
t 453
 
3.8%
s 438
 
3.7%
h 424
 
3.6%
Other values (58) 4323
36.4%
Common
ValueCountFrequency (%)
1048
94.5%
- 32
 
2.9%
. 19
 
1.7%
' 10
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12937
99.6%
None 55
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1239
 
9.6%
e 1088
 
8.4%
1048
 
8.1%
n 951
 
7.4%
r 816
 
6.3%
i 794
 
6.1%
o 767
 
5.9%
l 590
 
4.6%
t 453
 
3.5%
s 438
 
3.4%
Other values (45) 4753
36.7%
None
ValueCountFrequency (%)
ô 13
23.6%
é 7
12.7%
ü 6
10.9%
í 6
10.9%
û 4
 
7.3%
ö 4
 
7.3%
è 3
 
5.5%
å 2
 
3.6%
ë 2
 
3.6%
Ç 1
 
1.8%
Other values (7) 7
12.7%

Star2
Text

Distinct840
Distinct (%)84.1%
Missing0
Missing (%)0.0%
Memory size70.7 KiB
2025-09-02T10:50:59.703418image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length25
Median length22
Mean length13.122122
Min length4

Characters and Unicode

Total characters13109
Distinct characters69
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique728 ?
Unique (%)72.9%

Sample

1st rowAl Pacino
2nd rowHeath Ledger
3rd rowRobert De Niro
4th rowLee J. Cobb
5th rowViggo Mortensen
ValueCountFrequency (%)
john 21
 
1.0%
robert 16
 
0.8%
lee 13
 
0.6%
michael 13
 
0.6%
emma 10
 
0.5%
james 9
 
0.4%
chris 9
 
0.4%
george 8
 
0.4%
tom 8
 
0.4%
jack 8
 
0.4%
Other values (1388) 1940
94.4%
2025-09-02T10:51:01.212693image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1319
 
10.1%
e 1180
 
9.0%
1056
 
8.1%
n 949
 
7.2%
r 885
 
6.8%
i 785
 
6.0%
o 719
 
5.5%
l 579
 
4.4%
t 483
 
3.7%
s 432
 
3.3%
Other values (59) 4722
36.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9904
75.6%
Uppercase Letter 2101
 
16.0%
Space Separator 1056
 
8.1%
Dash Punctuation 24
 
0.2%
Other Punctuation 24
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1319
13.3%
e 1180
11.9%
n 949
9.6%
r 885
8.9%
i 785
 
7.9%
o 719
 
7.3%
l 579
 
5.8%
t 483
 
4.9%
s 432
 
4.4%
h 375
 
3.8%
Other values (28) 2198
22.2%
Uppercase Letter
ValueCountFrequency (%)
M 194
 
9.2%
S 161
 
7.7%
J 158
 
7.5%
C 136
 
6.5%
A 124
 
5.9%
B 122
 
5.8%
R 122
 
5.8%
H 108
 
5.1%
K 105
 
5.0%
D 105
 
5.0%
Other values (17) 766
36.5%
Other Punctuation
ValueCountFrequency (%)
. 15
62.5%
' 9
37.5%
Space Separator
ValueCountFrequency (%)
1056
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12005
91.6%
Common 1104
 
8.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1319
 
11.0%
e 1180
 
9.8%
n 949
 
7.9%
r 885
 
7.4%
i 785
 
6.5%
o 719
 
6.0%
l 579
 
4.8%
t 483
 
4.0%
s 432
 
3.6%
h 375
 
3.1%
Other values (55) 4299
35.8%
Common
ValueCountFrequency (%)
1056
95.7%
- 24
 
2.2%
. 15
 
1.4%
' 9
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13046
99.5%
None 63
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1319
 
10.1%
e 1180
 
9.0%
1056
 
8.1%
n 949
 
7.3%
r 885
 
6.8%
i 785
 
6.0%
o 719
 
5.5%
l 579
 
4.4%
t 483
 
3.7%
s 432
 
3.3%
Other values (45) 4659
35.7%
None
ValueCountFrequency (%)
é 19
30.2%
ô 9
14.3%
ö 7
 
11.1%
ç 6
 
9.5%
í 5
 
7.9%
ü 5
 
7.9%
è 3
 
4.8%
Ö 2
 
3.2%
á 2
 
3.2%
ó 1
 
1.6%
Other values (4) 4
 
6.3%

Star3
Text

Distinct890
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size70.4 KiB
2025-09-02T10:51:02.242563image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length27
Median length21
Mean length13.283283
Min length4

Characters and Unicode

Total characters13270
Distinct characters73
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique808 ?
Unique (%)80.9%

Sample

1st rowJames Caan
2nd rowAaron Eckhart
3rd rowRobert Duvall
4th rowMartin Balsam
5th rowIan McKellen
ValueCountFrequency (%)
john 21
 
1.0%
robert 16
 
0.8%
michael 13
 
0.6%
richard 12
 
0.6%
christopher 9
 
0.4%
jack 8
 
0.4%
paul 8
 
0.4%
george 7
 
0.3%
lee 7
 
0.3%
harris 7
 
0.3%
Other values (1460) 1951
94.8%
2025-09-02T10:51:03.824819image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1290
 
9.7%
e 1152
 
8.7%
1060
 
8.0%
n 925
 
7.0%
i 894
 
6.7%
r 864
 
6.5%
o 757
 
5.7%
l 626
 
4.7%
t 443
 
3.3%
s 423
 
3.2%
Other values (63) 4836
36.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 10057
75.8%
Uppercase Letter 2097
 
15.8%
Space Separator 1060
 
8.0%
Other Punctuation 33
 
0.2%
Dash Punctuation 23
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1290
12.8%
e 1152
11.5%
n 925
9.2%
i 894
 
8.9%
r 864
 
8.6%
o 757
 
7.5%
l 626
 
6.2%
t 443
 
4.4%
s 423
 
4.2%
h 392
 
3.9%
Other values (33) 2291
22.8%
Uppercase Letter
ValueCountFrequency (%)
M 189
 
9.0%
J 158
 
7.5%
S 156
 
7.4%
C 142
 
6.8%
R 141
 
6.7%
B 139
 
6.6%
A 119
 
5.7%
G 113
 
5.4%
H 112
 
5.3%
K 107
 
5.1%
Other values (16) 721
34.4%
Other Punctuation
ValueCountFrequency (%)
. 23
69.7%
' 10
30.3%
Space Separator
ValueCountFrequency (%)
1060
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12154
91.6%
Common 1116
 
8.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1290
 
10.6%
e 1152
 
9.5%
n 925
 
7.6%
i 894
 
7.4%
r 864
 
7.1%
o 757
 
6.2%
l 626
 
5.2%
t 443
 
3.6%
s 423
 
3.5%
h 392
 
3.2%
Other values (59) 4388
36.1%
Common
ValueCountFrequency (%)
1060
95.0%
. 23
 
2.1%
- 23
 
2.1%
' 10
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13219
99.6%
None 51
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1290
 
9.8%
e 1152
 
8.7%
1060
 
8.0%
n 925
 
7.0%
i 894
 
6.8%
r 864
 
6.5%
o 757
 
5.7%
l 626
 
4.7%
t 443
 
3.4%
s 423
 
3.2%
Other values (45) 4785
36.2%
None
ValueCountFrequency (%)
é 11
21.6%
ü 5
9.8%
á 5
9.8%
í 4
 
7.8%
ô 4
 
7.8%
û 4
 
7.8%
ó 3
 
5.9%
å 2
 
3.9%
ö 2
 
3.9%
ç 2
 
3.9%
Other values (8) 9
17.6%

Star4
Text

Distinct938
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size71.0 KiB
2025-09-02T10:51:04.801519image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

Max length27
Median length23
Mean length13.211211
Min length4

Characters and Unicode

Total characters13198
Distinct characters73
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique881 ?
Unique (%)88.2%

Sample

1st rowDiane Keaton
2nd rowMichael Caine
3rd rowDiane Keaton
4th rowJohn Fiedler
5th rowOrlando Bloom
ValueCountFrequency (%)
john 25
 
1.2%
michael 15
 
0.7%
james 12
 
0.6%
lee 9
 
0.4%
richard 9
 
0.4%
mark 8
 
0.4%
bill 8
 
0.4%
martin 7
 
0.3%
charles 7
 
0.3%
kim 7
 
0.3%
Other values (1557) 1960
94.8%
2025-09-02T10:51:06.033555image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1282
 
9.7%
e 1127
 
8.5%
1068
 
8.1%
n 903
 
6.8%
r 901
 
6.8%
i 861
 
6.5%
o 710
 
5.4%
l 631
 
4.8%
s 445
 
3.4%
t 419
 
3.2%
Other values (63) 4851
36.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 9956
75.4%
Uppercase Letter 2113
 
16.0%
Space Separator 1068
 
8.1%
Dash Punctuation 32
 
0.2%
Other Punctuation 29
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1282
12.9%
e 1127
11.3%
n 903
 
9.1%
r 901
 
9.0%
i 861
 
8.6%
o 710
 
7.1%
l 631
 
6.3%
s 445
 
4.5%
t 419
 
4.2%
h 409
 
4.1%
Other values (30) 2268
22.8%
Uppercase Letter
ValueCountFrequency (%)
M 200
 
9.5%
S 173
 
8.2%
B 166
 
7.9%
J 161
 
7.6%
R 135
 
6.4%
C 134
 
6.3%
A 117
 
5.5%
K 112
 
5.3%
D 104
 
4.9%
L 101
 
4.8%
Other values (19) 710
33.6%
Other Punctuation
ValueCountFrequency (%)
. 20
69.0%
' 9
31.0%
Space Separator
ValueCountFrequency (%)
1068
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12069
91.4%
Common 1129
 
8.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1282
 
10.6%
e 1127
 
9.3%
n 903
 
7.5%
r 901
 
7.5%
i 861
 
7.1%
o 710
 
5.9%
l 631
 
5.2%
s 445
 
3.7%
t 419
 
3.5%
h 409
 
3.4%
Other values (59) 4381
36.3%
Common
ValueCountFrequency (%)
1068
94.6%
- 32
 
2.8%
. 20
 
1.8%
' 9
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13136
99.5%
None 62
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1282
 
9.8%
e 1127
 
8.6%
1068
 
8.1%
n 903
 
6.9%
r 901
 
6.9%
i 861
 
6.6%
o 710
 
5.4%
l 631
 
4.8%
s 445
 
3.4%
t 419
 
3.2%
Other values (46) 4789
36.5%
None
ValueCountFrequency (%)
é 12
19.4%
ô 11
17.7%
ö 9
14.5%
û 5
8.1%
á 4
 
6.5%
è 3
 
4.8%
ø 2
 
3.2%
å 2
 
3.2%
Á 2
 
3.2%
ë 2
 
3.2%
Other values (7) 10
16.1%

Votes
Real number (ℝ)

High correlation 

Distinct998
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean271621.42
Minimum25088
Maximum2303232
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2025-09-02T10:51:06.415986image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum25088
5-th percentile29680
Q155471.5
median138356
Q3373167.5
95-th percentile939289.9
Maximum2303232
Range2278144
Interquartile range (IQR)317696

Descriptive statistics

Standard deviation320912.62
Coefficient of variation (CV)1.1814702
Kurtosis6.041324
Mean271621.42
Median Absolute Deviation (MAD)98475
Skewness2.1943511
Sum2.713498 × 108
Variance1.0298491 × 1011
MonotonicityNot monotonic
2025-09-02T10:51:07.099206image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65341 2
 
0.2%
171640 1
 
0.1%
699256 1
 
0.1%
32802 1
 
0.1%
93878 1
 
0.1%
1213505 1
 
0.1%
51853 1
 
0.1%
1642758 1
 
0.1%
2067042 1
 
0.1%
1854740 1
 
0.1%
Other values (988) 988
98.9%
ValueCountFrequency (%)
25088 1
0.1%
25198 1
0.1%
25229 1
0.1%
25312 1
0.1%
25344 1
0.1%
25938 1
0.1%
26337 1
0.1%
26402 1
0.1%
26429 1
0.1%
26457 1
0.1%
ValueCountFrequency (%)
2303232 1
0.1%
2067042 1
0.1%
1854740 1
0.1%
1826188 1
0.1%
1809221 1
0.1%
1676426 1
0.1%
1661481 1
0.1%
1642758 1
0.1%
1620367 1
0.1%
1516346 1
0.1%

Revenue
Real number (ℝ)

High correlation  Missing 

Distinct822
Distinct (%)99.0%
Missing169
Missing (%)16.9%
Infinite0
Infinite (%)0.0%
Mean68082574
Minimum1305
Maximum9.3666222 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.9 KiB
2025-09-02T10:51:07.808906image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

Minimum1305
5-th percentile139783.9
Q13245338.5
median23457440
Q380876340
95-th percentile2.9163069 × 108
Maximum9.3666222 × 108
Range9.3666092 × 108
Interquartile range (IQR)77631002

Descriptive statistics

Standard deviation1.0980755 × 108
Coefficient of variation (CV)1.6128584
Kurtosis13.894054
Mean68082574
Median Absolute Deviation (MAD)22698854
Skewness3.1277452
Sum5.6508537 × 1010
Variance1.2057699 × 1016
MonotonicityNot monotonic
2025-09-02T10:51:08.428728image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4360000 5
 
0.5%
5321508 2
 
0.2%
5450000 2
 
0.2%
9600000 2
 
0.2%
25000000 2
 
0.2%
216540909 1
 
0.1%
49530280 1
 
0.1%
78756177 1
 
0.1%
292576195 1
 
0.1%
30500000 1
 
0.1%
Other values (812) 812
81.3%
(Missing) 169
 
16.9%
ValueCountFrequency (%)
1305 1
0.1%
3296 1
0.1%
3600 1
0.1%
6013 1
0.1%
6460 1
0.1%
7461 1
0.1%
8060 1
0.1%
10177 1
0.1%
10950 1
0.1%
12562 1
0.1%
ValueCountFrequency (%)
936662225 1
0.1%
858373000 1
0.1%
760507625 1
0.1%
678815482 1
0.1%
659325379 1
0.1%
623279547 1
0.1%
608581744 1
0.1%
534858444 1
0.1%
532177324 1
0.1%
448139099 1
0.1%

Interactions

2025-09-02T10:50:35.349093image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:12.973071image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:16.189493image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:19.235406image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:22.589721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:27.877441image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:31.711385image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:35.717980image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:13.471547image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:16.610586image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:19.731784image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:23.055235image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:28.375988image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:32.450575image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:36.121706image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:13.884053image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:16.990202image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:20.228400image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:23.607331image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:28.871128image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:32.976689image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:36.559927image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:14.267839image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:17.449112image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:20.633843image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:24.111804image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:29.451328image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:33.407118image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:36.953731image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:14.778894image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:17.913319image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:21.254116image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:26.613539image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:30.084685image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:34.079100image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:37.424945image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:15.233812image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:18.459704image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:21.682960image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:27.048199image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:30.541556image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:34.506707image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:37.813091image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:15.694194image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:18.874878image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:22.149118image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:27.531590image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:31.102746image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
2025-09-02T10:50:34.973382image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Correlations

2025-09-02T10:51:08.829622image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
CertificateRatingRevenueRuntimeUnnamed: 0VotesYearscoreAvg
Certificate1.0000.0000.0630.1410.0720.0570.3020.088
Rating0.0001.000-0.0500.210-0.9920.212-0.1270.285
Revenue0.063-0.0501.0000.1780.0360.7000.175-0.100
Runtime0.1410.2100.1781.000-0.2330.1570.194-0.090
Unnamed: 00.072-0.9920.036-0.2331.000-0.2450.012-0.259
Votes0.0570.2120.7000.157-0.2451.0000.255-0.073
Year0.302-0.1270.1750.1940.0120.2551.000-0.264
scoreAvg0.0880.285-0.100-0.090-0.259-0.073-0.2641.000

Missing values

2025-09-02T10:50:38.345812image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
A simple visualization of nullity by column.
2025-09-02T10:50:38.958553image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2025-09-02T10:50:39.526441image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0TitleYearCertificateRuntimeGenreRatingOverviewscoreAvgDirectorStar1Star2Star3Star4VotesRevenue
01The Godfather1972A175Crime, Drama9.2An organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son.100Francis Ford CoppolaMarlon BrandoAl PacinoJames CaanDiane Keaton1620367134966411.0
12The Dark Knight2008UA152Action, Crime, Drama9.0When the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice.84Christopher NolanChristian BaleHeath LedgerAaron EckhartMichael Caine2303232534858444.0
23The Godfather: Part II1974A202Crime, Drama9.0The early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate.90Francis Ford CoppolaAl PacinoRobert De NiroRobert DuvallDiane Keaton112995257300000.0
3412 Angry Men1957U96Crime, Drama9.0A jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence.96Sidney LumetHenry FondaLee J. CobbMartin BalsamJohn Fiedler6898454360000.0
45The Lord of the Rings: The Return of the King2003U201Action, Adventure, Drama8.9Gandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring.94Peter JacksonElijah WoodViggo MortensenIan McKellenOrlando Bloom1642758377845905.0
56Pulp Fiction1994A154Crime, Drama8.9The lives of two mob hitmen, a boxer, a gangster and his wife, and a pair of diner bandits intertwine in four tales of violence and redemption.94Quentin TarantinoJohn TravoltaUma ThurmanSamuel L. JacksonBruce Willis1826188107928762.0
67Schindler's List1993A195Biography, Drama, History8.9In German-occupied Poland during World War II, industrialist Oskar Schindler gradually becomes concerned for his Jewish workforce after witnessing their persecution by the Nazis.94Steven SpielbergLiam NeesonRalph FiennesBen KingsleyCaroline Goodall121350596898818.0
78Inception2010UA148Action, Adventure, Sci-Fi8.8A thief who steals corporate secrets through the use of dream-sharing technology is given the inverse task of planting an idea into the mind of a C.E.O.74Christopher NolanLeonardo DiCaprioJoseph Gordon-LevittElliot PageKen Watanabe2067042292576195.0
89Fight Club1999A139Drama8.8An insomniac office worker and a devil-may-care soapmaker form an underground fight club that evolves into something much, much more.66David FincherBrad PittEdward NortonMeat LoafZach Grenier185474037030102.0
910The Lord of the Rings: The Fellowship of the Ring2001U178Action, Adventure, Drama8.8A meek Hobbit from the Shire and eight companions set out on a journey to destroy the powerful One Ring and save Middle-earth from the Dark Lord Sauron.92Peter JacksonElijah WoodIan McKellenOrlando BloomSean Bean1661481315544750.0
Unnamed: 0TitleYearCertificateRuntimeGenreRatingOverviewscoreAvgDirectorStar1Star2Star3Star4VotesRevenue
989990Giù la testa1971PG157Drama, War, Western7.6A low-life bandit and an I.R.A. explosives expert rebel against the government and become heroes of the Mexican Revolution.77Sergio LeoneRod SteigerJames CoburnRomolo ValliMaria Monti30144696690.0
990991Kelly's Heroes1970GP144Adventure, Comedy, War7.6A group of U.S. soldiers sneaks across enemy lines to get their hands on a secret stash of Nazi treasure.50Brian G. HuttonClint EastwoodTelly SavalasDon RicklesCarroll O'Connor453381378435.0
991992The Jungle Book1967U78Animation, Adventure, Family7.6Bagheera the Panther and Baloo the Bear have a difficult time trying to convince a boy to leave the jungle for human civilization.65Wolfgang ReithermanPhil HarrisSebastian CabotLouis PrimaBruce Reitherman166409141843612.0
992993Blowup1966A111Drama, Mystery, Thriller7.6A fashion photographer unknowingly captures a death on film after following two lovers in a park.82Michelangelo AntonioniDavid HemmingsVanessa RedgraveSarah MilesJohn Castle56513NaN
993994A Hard Day's Night1964U87Comedy, Music, Musical7.6Over two "typical" days in the life of The Beatles, the boys struggle to keep themselves and Sir Paul McCartney's mischievous grandfather in check while preparing for a live television performance.96Richard LesterJohn LennonPaul McCartneyGeorge HarrisonRingo Starr4035113780024.0
994995Breakfast at Tiffany's1961A115Comedy, Drama, Romance7.6A young New York socialite becomes interested in a young man who has moved into her apartment building, but her past threatens to get in the way.76Blake EdwardsAudrey HepburnGeorge PeppardPatricia NealBuddy Ebsen166544NaN
995996Giant1956G201Drama, Western7.6Sprawling epic covering the life of a Texas cattle rancher and his family and associates.84George StevensElizabeth TaylorRock HudsonJames DeanCarroll Baker34075NaN
996997From Here to Eternity1953Passed118Drama, Romance, War7.6In Hawaii in 1941, a private is cruelly punished for not boxing on his unit's team, while his captain's wife and second-in-command are falling in love.85Fred ZinnemannBurt LancasterMontgomery CliftDeborah KerrDonna Reed4337430500000.0
997998Lifeboat1944NaN97Drama, War7.6Several survivors of a torpedoed merchant ship in World War II find themselves in the same lifeboat with one of the crew members of the U-boat that sank their ship.78Alfred HitchcockTallulah BankheadJohn HodiakWalter SlezakWilliam Bendix26471NaN
998999The 39 Steps1935NaN86Crime, Mystery, Thriller7.6A man in London tries to help a counter-espionage Agent. But when the Agent is killed, and the man stands accused, he must go on the run to save himself and stop a spy ring which is trying to steal top secret information.93Alfred HitchcockRobert DonatMadeleine CarrollLucie MannheimGodfrey Tearle51853NaN